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(4^  INTRODUCTION 

C-CAM  is  a  cell  adhesion  molecule  of  the  immunoglobulin  supergene  family  (1).  We 
have  recently  shown  that  C-CAM  plays  critical  roles  in  prostate  cancer  initiation  and  progression 
and  that  loss  of  C-CAM  is  an  early  event  in  the  development  of  prostate  cancer  (2).  Although 
tumorigenesis  studies  in  mouse  xenograft  model  have  suggested  the  involvement  of  C-CAM  in 
epithelial  cell  growth  and  differentiation,  the  functional  roles  of  C-CAM  in  normal  prostate 
development,  prostate  homeostasis,  and  prostate  tumorigenesis  remain  unclear.  Towards  the  aim 
of  determining  the  roles  of  C-CAM's  growth  suppressive  activity  in  prostate  growth  and 
tumorigenesis,  we  propose  to  use  gene  targeting  and  embryonic  stem  cell  technologies  to 
generate  C-CAM  knockout  mice.  Specifically,  we  plan  (1)  to  determine  the  roles  of  C-CAM’s 
growth  suppressive  function  in  vivo  by  generating  mice  with  a  targeted  deletion  of  the  C-CAM 
cytoplasmic  domain;  (2)  to  determine  the  roles  of  C-CAM's  growth  suppressive  function  in 
prostate  development  and  tumorigenesis  by  generating  mice  with  a  prostate-specific  knockout  of 
the  C-CAM  cytoplasmic  domain.  The  proposed  work  was  divided  into  two  Tasks  to  be  carried 
out  in  parallel. 

Task  1.  Generate  mice  with  targeted  deletion  of  C-CAM  cytoplasmic  domain  to  determine  the 
roles  of  C-CAM's  growth  suppressive  function  in  vivo  (months  1-30) 

Task  2.  Prostate-specific  loss  of  function  of  C-CAM  gene  in  prostate  (months  7-36) 

Genetic  manipulation  of  mouse  genes  in  vivo  is  a  powerful  approach  for  understanding  the 
function  of  a  gene,  both  during  embryonic  development  and  in  adult  tissues.  This  method 
requires  full  knowledge  of  the  genomic  structure  of  the  gene  of  interest.  Unlike  humans  and 
rats,  which  each  have  one  C-CAMl  (renamed  as  Ceacaml  (3))  gene,  two  Ceacaml-like  genes, 
Ceacaml  and  Ceacam2,  were  identified  in  mice.  This  poses  potential  problems  in  any  attempt 
to  manipulate  these  genes.  First,  if  the  two  genes  have  the  same  function,  deletion  of  one  gene 
may  not  produce  any  phenotypic  alteration.  Second,  if  there  is  no  significant  difference  in  the 
genes'  sequences,  any  genetic  manipulation  may  not  be  specific  for  a  single  gene,  or  it  might  be 
difficult  to  ascertain  which  gene  was  altered.  It  is,  therefore,  essential  to  determine  the 
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complete  sequences  and  expression  profiles  of  these  related  Ceacam  genes  in  the  129sv  mouse 
strain  before  genetic  manipulation  performed.  We,  therefore,  isolated  and  sequenced  the 
Ceacaml  and  Ceacam2  genes  from  a  mouse  129  Sv/Ev  library.  In  addition,  we  also  examined 
the  tissue-specific  and  embryonic  expressions  of  these  mouse  Ceacaml  and  Ceacam2  genes. 

151  BODY  (Progress  report) 

5.1.  Studies  nerformed  under  Task  1 -Generate  mice  with  targeted  deletion  of  C-CAM 
cytoplasmic  Hnmain  to  determine  the  roles  of  C-CAM*s  growth  suppressive  function  in  vivo 
Imonths  1-30) 

5.1.1.  Determine  the  genomic  structure  of  Ceacaml  and  Ceacaml  genes  in  129Sv/Ev  mouse: 

A  129  Sv/Ev  mouse  genomic  library  in  A.LEX  (provided  by  Dr.  Li- Yuan  Yu-Lee,  Baylor 
College  of  Medicine,  Houston,  TX,  U.S.A.)  was  screened  with  the  1.6-kb  full-length  mouse 
Ceacaml  cDNA  (4)  which  had  been  labeled  by  using  the  Klenow  fragment  of  DNA  polymerase 
I,  random  hexanucleotide  primers  and  [a-^^P]  dCTP  (5).  Twenty-four  positive  clones  were 
identified  in  the  initial  screening  of  this  genomic  library.  PCR  with  Oligo  127  (5- 
GTGTCACTCTAGGCTACAGGAAAT-3’)  and  Oligo  1  2  2  (5'- 

GAGGCCAGCTCCATGTCTCTGCTG-3'),  which  are  specific  to  the  5'  region  of  mouse 
Ceacaml  (6),  showed  that  seven  of  the  clones  had  the  N-terminal  Ceacaml  sequence.  Similarly, 
PCR  with  Oligo  120  (5’-GAAGTCTGGCGGATCTGGCTCCTT-3')  and  Oligo  131  (5'- 
TTGAAGTTCAGGACAGTGTATGCG-3')  showed  that  seven  clones  had  the  3'  Ceacaml 
sequence.  These  clones  were  isolated  by  secondary  and  tertiary  screening  and  characterized  by 
restriction  mapping.  The  nucleotide  sequences  of  the  exons  and  introns  were  determined  by 
primer  walking  using  specific  oligonucleotide  primers.  Sequencing  was  performed  by  the  DNA 
Sequencing  Core  Facilities  at  M.  D.  Anderson  Cancer  Center  with  an  automated  fluorescent 
DNA  sequencer  (Applied  Biosystems  Inc.,  Ramsey,  NJ,  U.S.A.). 

The  seven  clones  with  the  5’  of  Ceacaml  had  different  restriction  maps,  as  did  another 
seven  clones  with  the  3’  end.  The  different  restriction  maps  seemed  to  reflect  different  genes. 
Grouping  these  clones  according  to  their  restriction  maps  and  DNA  sequences  revealed  two 
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distinct  sequences,  Ceacaml  and  Ceacam2.  The  overlapping  genomic  clones  that  spanned 
Ceacaml  and  Ceacam2  are  shown  in  Figure  1.  DNA  sequence  analysis  of  both  strands  of  these 
two  genes  revealed  that  Ceacaml  and  Ceacam2  contained  nine  exons  each  and  were 
approximately  18.3  and  24.65  kb,  respectively  (Figure  1). 

The  sizes  of  the  exons  and  introns  and  the  intron/exon  boundary  sequences  for  Ceacaml 
and  Ceacam2  are  shown  in  Table  1  and  2.  The  first  exon,  which  codes  for  the  first  two-thirds  of 
the  signal  sequence,  was  304  bp  in  both  Ceacaml  and  Ceacam2.  In  contrast,  exons  2-5  were 
each  about  300  bp,  and  each  coded  for  one  Ig-like  domain  in  both  Ceacaml  and  Ceacam2.  The 
transmembrane  domain  was  encoded  by  exon  6  and  the  cytoplasmic  domain  by  exons  7-9. 
Consistent  with  the  RNA  splicing  rule  (7),  each  intron  started  with  GT  at  the  5'  end  and  ended 
with  AG  at  the  3'  end  (Table  1  and  2). 

5.1.2.  Comparison  of  Ceacaml  and  Ceacam2  genes 

The  overall  similarity  between  Ceacaml  and  Ceacam2,  including  all  exons  and  introns, 
was  about  79.6%.  The  similarities  between  the  exons  and  introns  of  Ceacaml  and  Ceacam2  are 
shown  in  Table  3.  The  major  differences  between  these  two  genes  were  in  exon  2,  intron  2, 
intron  5,  exon  6,  and  intron  7,  which  were  76.9%,  29.7%,  59.8%,  78.5%,  and  64.5%  similar, 
respectively.  The  nucleotide  and  the  deduced  amino  acid  sequences  of  exon  2  for  Ceacaml  and 
Ceacam2  are  shown  in  Figure  2.  As  shown,  most  of  the  sequence  substitutions  in  exons  2,  which 
coded  for  the  first  Ig  domain,  did  result  in  amino  acid  substitutions  in  Ceacaml  and  Ceacam2. 
Thus,  the  amino  acid  identity  between  the  first  Ig  domains  of  CEACAMl  and  CEACAM2  was 
57.1%,  which  was  significantly  lower  than  the  similarity  of  other  domains.  In  addition, 
insertions  of  418  and  5849  bp  in  intron  2  and  of  1384  and  197  bp  in  introns  5  and  7,  respectively, 
were  found  in  Ceacaml  compared  with  Ceacaml  (Table  3  and  Figure  1C). 

5.1.3.  Expression  of  Ceacaml  and  Ceacaml  messages  in  mouse  tissues 

To  determine  the  tissue-specific  distribution  of  Ceacaml  and  Ceacam2,  we  performed 
semiquantitative  RT-PCR  on  a  panel  of  mouse  cDNAs  from  various  tissues  by  using  Ceacaml 
and  Ccaca/n2-specific  oligonucleotide  pairs.  A  mouse  cDNA  panel  containing  first-strand  cDNA 
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prepared  from  mouse  tissues  and  normalized  for  P-actin  expression  was  purchased  from  OriGene 
Technologies  Inc.  (Rockville,  MD,  U.S.A.)  and  used  to  analyze  the  expression  of  Ceacaml  and 
Ceacam2  in  various  mouse  tissues.  Oligonucleotides  specific  to  exon  2  of  Ceacaml  (Oligo  1 16, 
5'-AATCTGCCCCTGGCGCTTGGAGCC-3',  and  Oligo  179,  5’- 

AAATCGCACAGTCGCCTGAGTACG-3')  and  to  exon  2  of  Ceacam2  (Oligo  117,  5'- 
AATATGATGAAGGGAGTCTTGGCC-3',  and  Oligo  18  0,  5’- 

AAATTGTCCAGTCAGGACCCTACG-3’)  were  used  as  primers  to  detect  specific  messages  for 
Ceacaml  and  Ceacam2  by  PCR.  PCR  cycling  conditions  were  as  follows:  (1)  pre-denaturation, 
94‘'C,  3  min  for  one  cycle,  (2)  denaturation,  94°C,  30  s,  annealing,  60°C,  30  s,  and  extension, 
72°C,  2  min  for  35  cycles  and  (3)  final  extension,  72°C,  5  min.  The  predicted  size  of  the  PCR 
products  was  246  bp.  These  PCR  products  were  analyzed  by  agarose  gel  electrophoresis, 
transferred  onto  a  nylon  membrane  and  hybridized  with  ^^P-labeled  oligonucleotide  probes 
specific  to  Ceacaml  (Oligo  181,  5'-AACACTACGGCTATAGACAAA-3')  and  Ceacam2  (Oligo 
182,  5'-TCTACTACGTCTACAAATGCT-3'),  the  least  similar  exon  (Figure  2A).  The 
specificities  of  these  two  oligonucleotides  were  confirmed  by  Southern  blot  analysis.  As  shown 
in  Figure  3 A,  Oligo  181  hybridized  to  DNA  from  X3  {Ceacaml)  but  not  from  A,  13  {Ceacaml), 
whereas  Oligo  182  only  hybridized  to  X.13  DNA.  Both  Ceacaml  and  Ceacam2  mRNAs  were 
detected,  although  not  in  the  same  tissues.  This  observation  supports  the  notion  that  both 
Ceacaml  and  Ceacam2  are  expressed  in  mouse  (Figure  3B  and  3C).  High  levels  of  Ceacaml 
message  were  detected  in  liver,  small  intestine,  prostate,  and  spleen  (Figure  3B),  similar  to  the 
tissue  expression  pattern  in  humans  (8)  and  rats  (9,10).  Ceacaml  message  was  also  detected  in 
heart,  kidney,  stomach,  muscle,  skin,  and  uterus.  In  contrast,  Ceacaml  messages  were  detected 
only  in  kidney,  testis,  and,  to  a  lesser  extent,  spleen  (Figure  3C).  These  observations 
demonstrated  that  the  Ceacaml  and  Ceacam2  genes  are  differentially  expressed  in  mouse 
tissues. 

5.1.4.  Ceacam2  cDNA 

Although  they  were  detected  in  testis,  kidney  and  spleen,  Ceacam2  messages  may  not 
contain  open  reading  frames  for  protein  translation.  As  testis  expressed  only  Ceacam2,  the 
cDNAs  coding  for  Ceacam2  were  obtained  from  testis  RNA  by  RT-PCR.  RNA  was  prepared 
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from  mouse  testis  by  using  RNAzol  B  (TEL-TEST  Inc.  Friendswood,  TX,  U.S.A.).  The  cDNAs 
coding  for  Ceacaml  were  obtained  from  the  testis  RNA  by  reverse  transcription  (RT)-PCR  with 
Oligo  630  (5'-GAATTCAAGCTTAAGAAGCTAGCAGGCAGCAGAGAC-3’),  which  contains 
nt  -36  to  +1  of  exon  1,  and  Oligo  631  (5'- 

GCGGCCGCCTAATGATGATGATGATGATGATGCTTCTTTTTTACTTCTGAATAAAC- 
3'),  which  is  complementary  to  the  end  of  the  coding  sequence  in  exon  9  plus  seven  histidine 
codons.  RT-PCR  was  performed  with  Oligo  630  and  Oligo  631  according  to  the  procedures 
provided  by  the  manufacturer  (Amersham/Pharmacia  Corp.,  Arlington  Heights,  IL,  U.S.A.).  The 
1-kb  PCR  product  obtained  only  hybridized  to  Ceacam2-s^ec\f\c  Oligo  182,  not  to  Ceacaml- 
specific  Oligo  181,  suggesting  that  the  fragment  codes  for  Ceacam2.  This  result  is  consistent 
with  the  tissue  distribution  of  Ceacam2  (Figure  3C).  The  PCR  product  was  subcloned  into 
pCRII-topo  (Invitrogen,  San  Diego,  CA,  U.S.A.)  and  its  sequence  was  determined  (Figure  4). 
The  mouse  testis  cDNA  contained  exon  1,  2,  5,  6,  8,  and  9,  and  did  have  an  open  reading  frame 
of  273  amino  acids  (Figure  4B). 

5.1.5.  Expression  of  Ceacaml  and  Ceacaml  during  mouse  embryonic  development 

The  expression  of  Ceacaml  and  Ceacam2  was  examined  in  mouse  embryos  at  8.5,  9.5, 
12.5  and  19  days  of  embryonic  development  by  RT-PCR  hybridization  as  described  above. 
Using  Ceacamf -specific  Oligo  181,  we  observed  age-related  differences  in  mRNA  levels:  we 
detected  the  message  at  day  8.5,  it  disappeared  at  day  9.5-12.5,  and  it  reappeared  at  day  19 
(Figure  5A).  In  contrast,  no  hybridization  signal  was  detected  when  Ccacam2-specific  Oligo 
182  was  used.  This  result  suggests  that  the  expression  of  Ceacaml,  but  not  Ceacaml,  is 
developmentally  regulated. 

5.2.  Studies  performed  under  Task  2— Prostate-specific  loss  of  function  of  C-CAM  gene  in 
prostate 

5.2.1. Development  of  gene  targeting  strategy 

Because  targeted  gene  deletion  is  performed  in  the  129sv  mouse,  we  determined  the 
complete  genomic  structure  and  DNA  sequences  of  mouse  Ceacaml  and  Ceacam2  genes  in  a 
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129Sv/Ev  mouse  genomic  library.  We  also  examined  the  expression  patterns  of  these  two  genes 
in  adult  mouse  tissues  and  during  embryonic  development.  Several  conclusions  can  be  made 
from  this  study.  First,  the  Ceacaml  and  Ceacam2  genes  contained  sufficient  sequence 
differences  that  targeted  gene  deletion  specific  to  either  gene  should  be  feasible.  Second, 
functional  redundancy  may  not  be  a  problem  when  only  one  of  these  two  genes  is  deleted 
because  although  Ceacaml  and  Ceacaml  were  highly  homologous,  they  had  different  tissue 
expression  patterns.  Third,  Ceacaml  is  probably  more  important  than  Ceacaml  in  the  mouse 
because  Ceacaml 's  tissue  expression  pattern  was  similar  to  those  of  the  single  Ceacam  genes  in 
humans  and  rats.  Conservation  of  the  expression  profile  among  these  different  species  suggests 
that  Ceacaml 's  function  may  be  essential.  Fourth,  Ceacaml  plays  no  role  in  embryonic 
development;  it  is  not  expressed  in  the  mouse  embryo. 

The  existence  of  the  highly  homologous  second  Ceacam  gene,  Ceacaml,  in  mice  raises 
the  possibility  that  Ceacaml  is  a  pseudogene.  However,  our  results  indicate  that  Ceacaml  is 
probably  not  a  pseudogene  because  Ceacaml  contained  a  complete  set  of  exons  and  introns 
typical  of  a  Ceacam  gene;  Ceacaml  was  transcribed  in  mouse  as  evidenced  by  the  presence  of 
Ceacaml  message  in  several  tissues;  and  Ceacaml  message  contained  an  open  reading  frame  of 
273  amino  acids  in  testis,  as  it  has  been  shown  in  the  CMT-93  mouse  rectal  carcinoma  cells  (1 1). 
These  findings  raise  the  interesting  question  of  Ceacaml’ s  function.  One  of  the  functions  of 
CEACAMl  is  inhibition  of  tumor  growth  (12,13).  This  suggests  that  this  protein  may  play  an 
important  role  in  regulating  cell  growth  and  differentiation.  Structural  and  functional  analyses  of 
rat  CEACAM  1  revealed  that  the  tumor  suppressive  function  requires  a  long  cytoplasmic  domain 
generated  by  alternative  splicing  (13-15).  Because  the  cytoplasmic  domains  of  CEACAMl  and 
CEACAM2  were  identical,  CEACAM2  may  also  have  growth-suppressive  activity.  However, 
the  role  of  CEACAM2  in  testis,  which  is  composed  of  rapidly  dividing  cells  in  spermatogenesis, 
is  not  clear.  Thus,  the  in  vivo  function  of  CEACAM2  may  not  be  revealed  until  its  gene  is 
deleted  in  the  mouse. 

Knocking  out  a  gene  can  have  no  phenotypic  effect  if  related  genes  have  similar 
functions.  We  showed  that  Ceacaml  and  Ceacaml  were  expressed  in  different  tissues  in  the 
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mouse.  Ceacaml  message  was  deteeted  in  tissues  rich  in  epithelial  cells  which  is  consistent  with 
expression  of  the  CEACAMl  homologue  in  rats  and  humans  (16-18).  In  contrast,  Ceacaml 
message  was  abundantly  expressed  in  testis,  which  does  not  express  Ceacaml  message.  In 
addition,  Ceacaml  message  was  undetectable  in  the  mouse  embryo,  whereas  Ceacaml  message 
was  developmentally  regulated.  Thus,  it  appears  that  the  two  mouse  Ceacam  genes  are  not 
functionally  redundant.  However,  we  cannot  rule  out  the  possibility  that  the  loss  of  Ceacaml 
expression  may  upregulate  expression  of  Ceacaml.  These  studies  would  have  to  await  targeted 
gene  deletion  of  these  two  related  genes. 

The  results  of  this  study  provide  important  information  for  designing  gene  targeting 
strategies  for  functional  studies  of  Ceacam  genes,  by  either  deleting  or  introducing  mutations  in 
the  Ceacam  genes  in  the  mouse  germline.  The  differences  in  the  two  DNA  sequences  can  be 
used  to  target  a  specific  gene.  As  Ceacaml  lacked  the  418  and  5849  bp  insertions  in  Ceacaml 
intron  2  and  the  1384  bp  insertion  present  in  Ceacaml  intron  5  (Figure  1C),  gene-targeting 
vectors  containing  introns  2  or  5  of  Ceacaml  or  Ceacaml  could  be  used  to  achieve  selective 
homologous  recombination  in  the  desired  gene.  If  mutant  mice  with  only  one  Ceacam  gene 
deleted  survive,  then  they  can  be  further  crossed  to  generate  mice  deficient  in  both  Ceacam 
genes.  However,  as  Ceacaml  and  Ceacaml  are  both  on  chromosome  7,  it  is  possible  that  they 
are  too  close  to  construct  double  gene  knockout  mice  by  erossing  the  single  gene  knockout  mice. 

5.2.2.  Gene  targeting  strategy 

Based  on  the  information  from  the  genomic  characterization  of  Ceacaml  and  Ceacam2 
gene,  we  decided  to  delete  the  cytoplasmie  domain  of  Ceacaml.  This  strategy  is  due  to  the 
following  reasons:  (1)  The  Ceaeaml  gene  is  functionally  more  important  than  Ceacam2  gene;  (2) 
The  entire  Ceacaml  gene,  which  is  around  20  kb,  is  too  large  to  delete;  (3)  The  cytoplasmic 
domain  of  CEACAM  1  is  critical  for  tumor  suppressor  function.  The  knockout  construct  planned 
for  is  a  conditional  knockout  construct  deleting  exon  7~9.  This  way  we  will  have  the  option  of 
deleting  this  region  both  in  ES  cells  and  in  the  mice.  For  straight  knockout,  which  is  the  goal  of 
Task  1,  we  will  transfect  the  ere  recombinase  expression  plasmid  to  the  ES  cells  and  screen  for 
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ES  cells  that  have  deletions  in  exon  7-9  of  its  Ceacaml  gene  before  injection  the  ES  cells  to 
blastocysts.  For  the  conditional  knockout,  which  is  the  goal  of  Task  2,  the  ES  cells  that  has 
targeting  vector  will  be  injected  to  blastocysts  and  the  Ceacaml  gene  will  be  deleted  in  prostate 
by  crossing  with  transgenic  mouse  carrying  probasin-driven  ere  recombinase.  Therefore,  Task  1 
and  Task  2  are  being  performed  in  parallel. 

KEY  RESEARCH  ACCOMPLISHMENTS 

❖  Isolate  and  complete  sequence  two  closely  related  Ceacam  genes,  i.e.  Ceacaml  and 
Ceacam2,  from  a  mouse  129  Sv/Ev  library. 

❖  Examine  the  tissue-specific  and  embryonic  expression  of  these  mouse  Ceacaml  and 
Ceacani2  genes. 

❖  Design  knockout  strategy  that  allows  both  Taskl  and  Task2  being  carried  out  in  parallel. 

m  REPORTABT.E  OUTCOMES 

1  manuscript  accepted  for  publication  in  Biochem  J.:  Han,  E.,  Phan,  D.,  Lo,  P.,  Poy,  M.  N., 
Behringer,  R.,  Najjar,  S.,  and  Lin,  S.-H.:  Difference  in  tissue-specific  and  embryonic  expression 
of  mouse  Ceacaml  and  Ceacam2  genes. 

181  CONCIvIJSION  We  propose  to  determine  the  roles  of  CEACAM's  growth  suppressive 
activity  in  prostate  growth  and  tumorigenesis,  we  plan  to  use  gene  targeting  and  embryonic  stem 
cell  technologies  to  generate  CEACAM  knockout  mice.  We  have  isolated  and  sequenced  two 
closely  related  Ceacam  genes,  i.e.  Ceacaml  and  Ceacam2,  from  a  mouse  129  Sv/Ev  library.  We 
have  also  examined  the  tissue-specific  and  embryonic  expressions  of  these  mouse  Ceacaml  and 
Ceacam2  genes.  Results  from  this  study  allow  us  to  design  a  gene  targeting  strategy  that  is 
specific  to  Ceacaml  gene  and  also  allow  us  to  perform  both  straight  and  conditioned  knockout  of 
Ceacaml  gene  in  parallel. 
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Table  1  Intro n/exon  boundaries  of  Ceacaml 


Exon 

Size  (bp) 

Exon  3’ 

Intron 

Size  rbr)! 

Exon  5’ 

Exon 

1 

304 

ACAG 

gtaaggagatattcc... 

intron  1 

767 

. .  .tctcttccctcttag 

CCTCA 

Thr  A 

la  Ser 

Exon 

360 

CACC 

gtaagtaattatctg. . . 

intron  2 

1540 

. .  .ctattatctgcacag 

CC  ATA 

/ 

His  P 

ro  He 

Exon 

'X 

285 

ATCT 

gtgagtaacttcttc. . . 

intron  3 

467 

. .  .ttctgtttgctccag 

AT  GGT 

J 

He  T 

yr  Gly 

Exon 

A 

255 

CTTG 

gtaagtggatctctg... 

intron  4 

1727 

. .  .tctgtttgtccacag 

AG  CCA 

Leu  G 

In  Pro 

Exon 

276 

ATAT 

gtgagtgacctcgcc... 

intron  5 

5358 

. .  .ttctttccctgacag 

TTGAC 

lie  P 

he  Asp 

Exon 

121 

GGC  GG 

gtaggacagtctttc... 

intron  6 

796 

...ctcatatttatttag 

G  GGA 

0 

Gly  G1 

y  Gly 

Exon 

7 

53 

CAC  A 

gtaagtaaagccaat. . . 

intron  7 

587 

. .  .ttcttcctcccctag 

AT  CTG 

/ 

His  A 

sn  Leu 

Exon 

8 

32 

AAC  AAG 

gtgagcactgccact. . . 

intron  8 

948 

...ctctcatcctttcag 

GTG  GAT 

Asn  Lys  Val  Asp 


Exon  1180 
9 
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Table  2  Intron/exon  boundaries  of  Ceacaml 


Exon 

Size  ('bp') 

Exon  3’ 

Exon  1 

304 

ACAG 
Thr  A 

gtaaggagatattcc. 

Exon  2 

360 

CAC  A 

His  T 

gtaagtaattctctg. . 

Exon  3 

285 

ATCT 

He  T 

gtgagtaacttcttt. . 

Exon  4 

255 

CTTG 
Leu  G 

gtaagtggatctctg. 

Exon  5 

276 

ATAT 

He  P 

gtgagtgaccttgcc. 

Exon  6 

118 

CGC  TG 
Arg  Tr 

gtaggacagtctttc. . 

Exon  7 

53 

CAC  A 
His  A 

gtaagtaaagccaat. 

Exon  8 

32 

AAC  AAG 
Asn  Lys 

gtgagcactgccact. 

Exon  9 

1180 

Intron 

Size  (bpl 

Exon  5’ 

intron  1 

766 

. .  .tctcttccctcttag 

CC  TCA 
la  Ser 

intron  2 

7807 

. .  .acacagtgcagacag 

CC  CTA 
hr  Leu 

intron  3 

465 

. .  .tcttgtttgctccag 

AT  GGT 
yr  Gly 

intron  4 

1717 

. .  .tctgtttgtccacag 

AG  CCA 
lu  Pro 

intron  5 

6737 

. .  .ttctttccctgacag 

TTGAC 
he  Asp 

intron  6 

798 

. .  .ctcatatttatttag 

GGGA 

P  Gly 

intron  7 

771 

...ctcttcctcccctag 

AT  CTG 
sn  Leu 

intron  8 

973 

...ctctcatattttcag 

GTG  GAT 
Val  Asp 
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Table  3  Intron/exon  lengths  and  similarities  of  Ceacaml  and  Ceacaml 


Exon 

Length 

Ceacaml 

(bp) 

Ceacam2 

Similarity 

Intron 

Length  (bp) 
Ceacaml  Ceacam2 

Similarity 

1 

304 

304* 

88.9% 

1 

767 

766 

97.4% 

2 

360 

360 

76.9% 

2 

1540 

7807 

29.7% 

3 

285 

285 

97.9% 

3 

467 

465 

94.7% 

4 

255 

255 

99.6% 

4 

1726 

1717 

97.9% 

5 

276 

276 

87.0% 

5 

5358 

6737 

59.8% 

6 

121 

118 

78.5% 

6 

796 

798 

98.5% 

7 

53 

53 

100% 

7 

587 

771 

64.5% 

8 

32 

32 

100% 

8 

948 

973 

94.3% 

9 

1180 

1180^ 

86.8% 

Includes  240  bp  of  promoter  and  5'  untranslated  sequence, 
^includes  889  bp  of  3'  untranslated  sequence. 
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FIGURE  LEGENDS 

Figure  1  Structure  of  the  mouse  Ceacaml  and  Ceacaml  genes 

Numbered  open  boxes  indicate  exons.  The  thick  line  connecting  the  exons  represents  introns.  (A) 
The  maps  of  two  overlapping  X  clones  (?i3  and  Xl)  that  contain  Ceacaml.  (B)  The  maps  of  two 
overlapping  X  clones  (X,13  and  A,15)  that  contain  Ceacaml.  The  restriction  maps  are  aligned  with 
the  intron/exon  map.  E,  EcoRI;  H,  Hindlll;  X,  Xbal.  The  enzyme  sites  in  parentheses  are  from  the 
multiple  cloning  region  of  the  Xphage  vector.  (C)  Comparison  of  the  mouse  Ceacaml  and 
Ceacaml  genes. 

Figure  2  Comparison  of  nucleotide  and  protein  sequences  of  Ceacaml  and  Ceacaml  exon  2 
Oligonucleotides  for  PCR  analyses  are  underlined.  Oligonucleotides  for  hybridization  (Oligo  1 8 1 
and  Oligo  182)  are  boxed.  The  nucleotides  are  numbered  with  the  A  of  the  start  ATG  as  the  first 
nucleotide  according  to  the  genomic  sequence.  The  amino  acids  are  numbered  with  the  start 
methionine  (ATG)  as  the  first  amino  acid. 

Figure  3  Expression  of  Ceacaml  and  Ceacaml  messages  in  various  mouse  tissues 

(A)  Specificity  of  Oligos  181  and  182  in  distinguishing  Ceacaml  from  Ceacaml.  (B  and  C) 

Tissue-specific  expression  of  Ceacaml  (B)  and  Ceacaml  (C)  analyzed  by  RT-PCR  and 
hybridization  with  Oligo  181  and  Oligo  182,  respectively.  The  positions  of  the  PCR  products  are 
indicated  by  the  arrows  on  the  right. 

Figure  4  Ceacaml 
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(A)  cDNA  Structure.  The  numbered  boxes  indicate  exons.  The  thick  lines  connecting  the  exons 
represent  introns.  The  dashed  lines  show  the  portions  of  the  exon  sequences  included  in  testis 
Ceacam2  cDNA.  (B)  cDNA  and  protein  sequence. 

Figure  5  Expression  of  Ceacaml  (A)  and  Ceacam2  (B)  messages  during  embryonic 
development 

(A)  Analysis  of  RT-PCR  products  of  Ceacaml  on  an  agarose  gel  and  gene-specific  hybridization 
using  Oligo  181.  (B)  Analysis  of  RT-PCR  products  of  Ceacam2  on  an  agarose  gel  and  gene- 
specific  hybridization  using  Oligo  182.  Marker  sizes  are  indicated  by  thin  arrows  and  the  size  of 
the  PCR  product  is  indicated  by  thick  arrow. 
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Figure  2 


Ceacaml 

Ceacam2 

Ceacaml 

Ceacam2 

Ceacaml 

Ceacam2 

Ceacaml 

Ceacam2 

Ceacaml 

Ceacam2 

Ceacaml 
Cea cam2 


Ceacaml 

Ceacam2 

Ceacaml 

Ceacam2 


Exon  2  (nucleotide) 

843  853  863  873  883  893 

CCTCACTTTTAGCCTCCTGGAGCCCTGCCACCACTGCTGAAGTCACCATTGAGGCTGTGC 


CCTCACTTTTAGCCTCCTGGAGCCCTCCCACCACTGCACAAGTGACTGTTATGGCTTTTC 


840 

850 

860 

870 

880 

890 

903 

913 

923 

933 

943 

953 

CGCCCCAGGTTGCTGAAGACAACAATGTTCTTCTACTTGTTCACAATCTGCCCCTGGCGC 


CACTCCACGCTGCTGAAGGCAACAATGTTATTCTAGTTGTTTACAATATGATGAAGGGAG 


900 

910 

920 

930 

940 

950 

963 

973 

983 

993 

1003 

1013 

ttggagcctttgcctggtacaagggaAacactacggctatagacaaa^aaattgcacgat 

tcttggcctttagctggcacaagggaIctactacgtctacaaatgct^aaattgtacgat 
960  970  980  990  1000  1010 

1023  1033  1043  1053  1063  1073 

ttgtaccaaatagtaatatgaatttcacggggcaagcatacagcggcagagagataatat 


ttgtaacaggcactaataagactataaaagggcctgtacacagtggcagagaggcactat 


1020 


1030 


1040 


1050 


1060 


1070 


1083 


1093 


1103 


1113 


1123 


1133 


acagcaatggatccctgctcttccaaatgatcaccatgaaggatatgggagtctacacac 


acagcaatggatccctgctcatccaaagggtcaccatgaaggatacgggagtctacacaa 
1080  1090  1100  1110  1120  1130 

1143  1153  1163  1173  1183  1193 

tagatatgacagatgaaaactatcgtcgtactcaggcgactgtgcgatttcatgtacacc 


tagaaatgacagatcaagactatcgtcgtagggtcctgactggacaatttcatgtacaca 
1140  1150  1160  1170  1180  1190 


Exon  2  (protein) 


32 


42 


52 


62 


72 


82 


SLLASWSPATTAEVTIEAVPPQVAEDNNVLLLVHNLPLALGAFAWYKGNTTAIDKEIARF 


SLLASWSPPTTAQVTVMAFPLHAAEGNNVILVVYNMMKGVLAFSWHKGSTTSTNAEIVRF 
32  42  52  62  72  82 

92  102  112  122  132 

VPNSNMNFTGQAYSGREIIYSNGSLLFQMITMKDMGVYTLDMTDENYRRTQATVRFHVH 


VTGTNKTIKGPVHSGREALYSNGSLLIQRVTMKDTGVYTIXMTDQDYRRRVLTGQFHVH 
92  102  112  122  132 
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Fig.  3  (B) 
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Fig.  3  (C) 
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141  bp 


Figure  4 

B 

-6 1  ATATCAGGGCAGCCAGGCTTAGCAGTAGTGTTGGAGAAGAAGCTAGCAGGCAGCAGAGAC  0 

1  ATGGAGCTGGCCTCAGCACATCTCCACAAAGGGCAGGTTCCCTGGTTTGGACTACTGCTC  60 
MELASAHLHKGQVPWFG  LLL 

61  ACAGCCTCACTTTTAGCCTCCTGGAGCCCTCCCACCACTGCACAAGTGACTGTTATGGCT  120 
TASLLASWSPPTTAQVTVMA 

121  TTTCCACTCCACGCCGCTGAAGGCAACAATGTTATTCTAGTTGTTTACAATATGATGAAG  180 
FPLHAAE  GNNVILVVYNMMK 

1 8 1  GGAGTCTCGGCCTTTAGCTGGCACAAGGGATCTACTACGTCTACAAATGCTGAAATTGTA  240 
GVSAFSWHKGSTTSTNAEIV 

24 1  CG  ATTTGTAACAGGC  ACTAATAAGACTATAA  AAGGGCCTGTACACAGTGGC  AG  AGAG  AC  A  300 
RFVTGTNKTIKGPVHSGR  ET 

301  CTATACAGCAATGGATCCCTGCTCATCCAAAGGGTCACCATGAAGGATACGGGAGTCTAC  360 
LY  S  NG  S  LLI  QRVTMKDTGVY 

361  ACA  ATAGAAATGACAGATCAAAGCTATCGTCGTAGGGTCCTGACTGGACAATTTCATGTA  420 
T  lEMTDQSYRRR  VLTGQFHV 

42 1  CACAAGCCAGTGACTCAGCCCTCCCTCCAAGTCACCAACACCACAGTCAAAGAACTAGAC  480 
HKPVTQPSLQVTNTTVKELD 

481  TCTGTCACCCTGACCTGCTTGTCGAAAGACCGTCAAGCCCACATCCATTGGATCTTCAAC  540 
SVTLTCLSKDRQAHIHWIFN 

54 1  AATGATACTCTTCTAATCACAGAGAAGATGACAACCTCTCAGGCGGG  ACTCATCCTCAAA  600 
NDTLL  ITEKMT  TSQAGLILK 

60 1  ATAGACCCTATTAAGAGGGAAGATGCCGGCGAGTATCAGTGTGAAATCTCGAATCCAGTC  660 
1  DPIKRE  DAGEYQCE  ISNPV 

66 1  AGCGTCAAGAGGAGCAACTCAATCAAACTGGAAGTAATATTTGACTCAACATATGACATT  720 
SVKR  SNSIKLEVI  FDSTYDI 

72 1  TCAGATGTCCCCATTGCTGTAATCATAACTGG  AGCTGTGGCCGGGGTG  ATTCTAATAGCA  780 
SDVPIAVI  ITGAVAGVILI  A 

78 1  GGGCTGGCATATCGCCTCTGTTCCAGGAAGTCTCGCTGATCTGGCTCCTTCTG  ACAACTC  840 
GLAYRLCSRKSR* 

84 1  TCCTAAC  A  AGGTGG  ATG  ACGTCGC  ATAC  ACTGTCCTG  AACTTC  AATTCCC  AGC  AACCC  A  A  900 

901  CCGGCC  AACTTC  AGCCCCTTCTTCTCCAAGAGCCACAGAAACAGTTTATTCAGAAGT  AAA  960 

961  AAAGAAG 


27 


Ceacaml  Ceacaml 


p  g*s/Oi(jqiua 


I 

I 

I 

I 


< 


in 


P  S'ZiMjquia 


28 


actin 


